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1 Special issue: Al in engineering | 

D. Sriram, R. Joobbani 

January 1985 ACM SIGART Bulletin, issue 91 

Full text available: ^ pdf(8.79 MB) Additional Information: full citation , abstract 

The papers in this special issue were compiled from responses to the announcement In the 
July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The interest 
being shown In this area is reflected in the sixty papers received from over six countries. 
About half the papers were received over the computer network. 

2 Fast detection of communication patterns in distributed executions | 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available:- ^ pdf(4.21 MB) Additional Information: full citation, abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 

3 Human-computer inteitace development: concepts and systems for its management | 
H. Rex Hartson, Deborah Hix 



March 1989 ACM Computing Surveys (CSUR), Volume 21 issue 1 

Additional Information: full citation , abstract , referenc 
terms, review 



Full text available: ^ Ddff7.97MB^ 



s. citings, index 



Human-computer interface management, from a computer science viewpoint, focuses on 
the process of developing quality human-computer interfaces, including their 
representation, design, implementation, execution, evaluation, and maintenance. This 
survey presents important concepts of interface management: dialogue independence, 
structural modeling, representation, interactive tools, rapid prototyping, development 
methodologies, and control structures. Dialogue independence is th ... 
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Andrew S. Tanenbaum, Robbert Van Renesse 

December 1985 ACM Computing Surveys (CSUR), Volume 17 issue 4 

Full text availabie:1 aaDdff5.49MBl Additional Information: full citation , abglri^, references , dtings. index 
terms, review 

Distributed operating systems have many aspects in common with centralized ones, but 
they also differ In certain ways. This paper is Intended as an introduction to distributed 
operating systems, and especially to current university research about them. After a 
discussion of what constitutes a distributed operating system and how it is distinguished 
from a computer network, various key design issues are discussed. Then several examples 
of current research projects are examined In some detail ... 

5 Pen computing: a technology overview and a vision | 

Andre Meyer 

July 1995 ACM SIGCHI Bulletin, Volume 27 issue 3 

Full text available: ^ pdf(5.14MB) Additional Information: full citation , abstract , citinos . index terms 

This work gives an overview of a new technology that Is attracting growing interest in public 
as well as in the computer industry itself. The visible difference from other technologies Is in 
the use of a pen or pencil as the primary means of interaction between a user and a 
machine, picking up the familiar pen and paper interface metaphor. From this follows a set 
of consequences that will be analyzed and put into context with other emerging 
technologies and visions.Starting with a short historic ... 

6 Software pipelining | 
Vlcki H. Allan, Reese B. Jones, Randall M. Lee, Stephen J. Allan 

September 1995 ACM Computing Surveys (CSUR), volume 27 issue 3 

rr II 4 ^ -I ui^ a ^MtA -yo RiiDv Additional Information: full citation , abstract , references, citings , index 
Full text avaiiaDie: T aO pqt(4.7^ mp) . 

tenms 

Utilizing parallelism at the instruction level is an important way to improve performance. 
Because the time spent in loop execution dominates total execution time, a large body of 
optimizations focuses on decreasing the time to execute each iteration. Software pipelining 
Is a technique that reforms the loop so that a faster execution rate is realized. Iterations 
are executed in overlapped fashion to increase parallelism. Let {ABC}n 
Keywords: Instruction level parallelism, loop reconstruction, optimization, software 
pipelining 



7 The NYU Ultracomputer— designing a IVIIMD. shared-memory parallel machine I 
(Extended Abstract) 

Allan Gottlieb, Ralph Grishman, Clyde P. Krusl<al, Kevin P. McAuliffe, Larry Rudolph, Marc Snir 
April 1982 Proceedings of the 9th annual symposium on Computer Architecture 

i- II * ^ I ui 01 ftiiDx Additional Information: full citation , abstract , references. citinQs. index 

Full text available: t Hpdf(1.36 MB) ^ ^ 

^ terms 

We present the design for the NYU Ultracomputer, a shared-memory I^II^D parallel machine 
composed of thousands of autonomous processing elements. This machine uses an 
enhanced message switching networic with the geometry of an Omega-network to 
approximate the ideal behavior of Schwartz's paracomputer model of computation and to 
Implement efficiently the important fetch-and-add synchronization primitive. We outline the 
hardware that would be required to build a 4096 processor system using 1990' ... 

8 Iterative schedule optimization for voltaae scalable distributed embedded systems 
Marcus T. Schmitz, Bashir AhHashimi, Petru Eles 

February 2004 ACM Transactions on Embedded Computing Systems (TECS), Volume 3 issue 
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1 

Full text available: ^ pdf(273.28 KB) Additional Information: full citation , abstract, references , index terms 

We present an iterative schedule optimization for multlrate system specifications, mapped 
onto heterogeneous distributed architectures containing dynamic voltage scalable 
processing elements (DVS-PEs). To achieve a high degree of energy reduction, we 
formulate a generalized DVS problem, taking into account the power variations among the 
executing tasks. An efficient heuristic is presented that identifies optimized supply voltages 
by not only "simply" exploiting slack time, but under the addition ... 

Keywords: Dynamic voltage scaling, embedded systems, energy minimization, 
heterogeneous distributed systems, scheduling, system synthesis 



9 The NYU ultracomputer — designing a MIMD. shared-memory parallel machine 

Allan Gottlieb, Ralph Grishman, Clyde P. Kruskal, Kevin P. McAuliffe, Larry Rudolph, Marc Snir 
August 1998 25 years of the international symposia on Computer architecture 
(selected papers) 

Full text available: ^ pdf(1.74 MB) Additional Information: full citation , references, index terms 



10 A high performance transparent bridge 

l^artlna Zitterbart, Ahmed N. Tantawy, Dimitrios N. Serpanos 

August 1994 IEEE/ACM Transactions on Networking (TON), volume 2 issue 4 

Full text available: ^ pdf(1.41 MB) Additional Infonnation: full citation, references, index terms 



Quei7 evaluation techniques for large databases 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), volume 25 issue 2 

Full text available' ^ pdf(9.37 IVIB) Additional Information: full citation, abstract , references, citings, index 
"^^'^ terms , review 

Database management systems will continue to manage large data volumes. Thus, efficient 
algorithms for accessing and manipulating large sets and sequences will be required to 
provide acceptable performance. The advent of object-oriented and extensible database 
systems will not solve this problem. On the contrary, modern data models exacerbate the 
problem: In order to manipulate large sets of complex objects as efficiently as today's 
database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible 
database systems, iterators, object-oriented database systems, operator model of 
parallelization, parallel algorithms, relational database systems, set-matching algorithms, 
sort-hash duality . 



^2 Cycles to recycle: garbage collection to the IA-64 

Richard L. Hudson, J. Elliot Moss, Sreenivas Subramoney, Weldon Washburn 

October 2000 ACM SIGPLAN Notices , Proceedings of the 2nd internatlonai symposium 

on Memory management, volume 36 issue i 
Full text available: ^pdf(1.25 MB) Additional Information: full citation , abstract , citings , index terms 

The IA-64, Inters 64-bit instruction set architecture, exhibits a number of interesting 
architectural features. Here we consider those features as they relate to supporting garbage 
collection (GC). We aim to assist GC and compiler Implementors by describing how one may 
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exploit features of the IA-64. Along the way, we record some previously unpublished object 
scanning techniques, and offer novel ones for object allocation (suggesting some simple 
operating system support that would simplify It ... 

13 Parallel execution of prolog programs: a survey 

Gopal Gupta, Enrico Pontelli, Khayri A.M. AN, Mats Carlsson, Manuel V. Hermenegildo 
July 2001 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 23 Issue 4 

Full text avallable: fB|Ddff1.95MB^ Additional Information: full citation , absliaa. reference?, siilngs. index 

terms 

Since the early days of logic programming, researchers in the field realized the potential for 
exploitation of parallelism present in the execution of logic programs. Their high-level 
nature, the presence of nondeterminism, and their referential transparency, among other 
characteristics, make logic programs interesting candidates for obtaining speedups through 
parallel execution. At the same time, the fact that the typical applications of logic 
programming frequently involve irregular computatio ... 

Keywords: Automatic parallelization, constraint programming, logic programming, 
parallelism, prolog 



1^ Technical correspondence 
CORPORATE Tech Correspondence 

October 1989 Communications of tlie ACM, volume 32 issue lo 

Full text available: ^ pdf(2.15MB) Additional Information: full citation, references, citings , index terms 



15 Atomic heap transactions and fine-arain interrupts | 
Olin Shivers, James W. Clark, Roland McGrath 

September 1999 ACM SIGPLAN Notices , Proceedings of the fourtli ACM SIGPLAN 

international conference on Functional programming, Volume 34 issue 9 

r- ^ ui 01 ^xf4Aizt.Aa^ Additional Information: full citation , abstract, references , citings, index 

Full text available: 1 pDdfn.45MB) ^ 

^ terms 

Languages such as Java, ML, Scheme, and Haskell provide automatic storage management, 
that is, garbage collection. The two fundamental operations performed on a garbage- 
collected heap are "allocate" and "collect." Because the heap is in an inconsistent state 
during these operations, they must be performed atomically. Otherwise, a heap client might 
access the heap during a time when its fundamental invariants do not hold, corrupting the 
heap.Standard techniques for providing this atomicity guaran ... 

16 Status report of the graphic standards planning committee 
Computer Graphics staff 

August 1979 ACM SXGGRAPH Computer Graphics, volume 13 issues 

Full text available: ^pdf(15.01 MBl Additional Infomiation: full citation, references , citinos 



17 The white dwarf: a hiah-performance application-specific processor 

A. Wolfe, M. Breternltz, G. Stephens, A. L. Ting, D. B. Kirk, R. P. Bianchini, J. P. Shen 
May 1988 ACM SIGARCH Computer Architecture News , Proceedings of the 15th 

Annual International Symposium on Computer architecture, volume 16 issue 2 
Full text available: 1 S|Ddf(1.40MB) Infomiation: full citation , abstrad. references , dfings. index 
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This paper presents the design and impiementation of a high-performance special-purpose 
processor, called The White Dwarf, for accelerating finite element analysis algorithms. The 
White Dwarf CPU contains two Am29325 32-bit floating-point processors and one Am29332 
32-bit ALU, and employs a wide-instruction word architecture in which the application 
algorithm is directly implemented in microcode. The entire system is VME-bus compatible 
and interfaces with a SUN 31160 host. The syste ... 

^8 Prefetching using Markov predictors 
Doug Joseph, Dirk Grunwald 

IMay 1997 ACM SIGARCH Computer Architecture News , Proceedings of the 24th 

annual international symposium on Computer architecture, volume 25 issue 2 
Full text available* ^ odfd 68 MB) Additional Information: full citation , abstract, references , citings, index 
• lo^' terms 

Prefetching is one approach to reducing the latency of memory operations in modern 
computer systems. In this paper, we describe the Markov prefetchen This prefetcher acts 
as an interface between the on-chip and off-chip cache, and can be added to existing 
computer designs. The Marlcov prefetcher is distinguished by prefetching multiple reference 
predictions from the memory subsystem, and then prioritizing the delivery of those 
references to the processor.This design results in a pr ... 

Novel ideas: A design space evaluation of grid processor architectures 

Ramadass Nagarajan, Karthil<eyan Sankaralingam, Doug Burger, Stephen W. Keckler 

December 2001 Proceedings of the 34th annual ACM/IEEE international symposium on 

M icroa rch itect u re 

Full text available: ^ .... St 

■[^pat(i,Z9MB)'!CT Additional Information: full citation , abstract, references , citings 

Publisher Site 

In this paper, we survey the design space of a new class of architectures called Grid 
Processor Architectures (CPAs). These architectures are designed to scale with technology, 
allowing faster clock rates than conventional architectures while providing superior 
instruction-level parallelism on traditional workloads and high performance across a range 
of application classes. A GPA consists of an array of ALUs, each with limited control, 
connected by a thin operand network. Programs are executed b ... 

20 The multicluster architecture: reducing cycle time through partitioning 
Keith I. Farkas, Paul Chow, Norman P. JouppI, Zvonko Vranesic 

December 1997 Proceedings of the 30th annual ACM/IEEE international symposium on 
Microarchitecture 

Full text available: p^f(i 44 jy/ip) ^ Additional Information: full citation , abstract, references , citings, index 

Publisher Site te™^ 

The multicluster architecture that we introduce offers a decentralized, dynamically- 
sclieduled architecture, in which the register files, dispatch queue, and functional units of 
the architecture are distributed across multiple clusters, and each cluster is assigned a 
subset of the architectural registers. The motivation for the multicluster architecture is to 
reduce the clock cycle time, relative to a single-cluster architecture with the same number 
of hardware resources, by reducing the size and ... 

Keywords: decentralized architecture, partitioned architecture, static instruction 
scheduling, register allocation 
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